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Abstract. We have developed a new method to improve the transit detection of Earth-sized 
planets in front of solar-like stars by fitting stellar microvariability by means of a spot model. 
A large Monte Carlo numerical experiment has been designed to test the performance of our 
approach in comparison with other variability filters and fitting techniques for stars of different 
magnitudes and planets of different radius and orbital period, as observed by the space missions 
i | CoRoT and Kepler. Here we report on the results of this experiment. 
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1. Introduction 



We present a comparison among the performance of three methods applied to filter stellar 
variability for the detection of Earth-like planetary transits in the light curves of solar- 

^-j- ■ like stars. This requires two steps: first, the filtering of stellar variability to remove the 
effects of photospheric cool spots and bright faculae, whose visibility is modulated by 
stellar rotation; secondly, the search for transits in the filtered light curves by means of 

tyQ ' suitable detection algorithms. 

We recently proposed a filtering method based on a model of the flux variations of 
the Sun as a star, the so-called 3-spot model (Lanza et al. 2003). Its performance was 
compared with that of another method, the 200-harmonic fitting, by Bonomo & Lanza 
(2008). They showed that the 3-spot model has a better performance than the latter 
when the standard deviation of the noise is at least 2-4 times larger than the central 
depth of the transit. On the other hand, the 200-harmonic fitting is better when the 
standard deviation of the noise is comparable to the transit depth. 

Here we extend the comparison to the iterative non-linear filter by Aigrain & Irwin 
(2004). A comparison among different variability filters is important since only the cou- 
pling of the best filtering method with the best planetary transit detection algorithm 
allows us to maximize transit detection efficiency. This is especially relevant when we 
want to detect small, terrestrial planets, which is a challenge to CoRoT and Kepler 
missions. 
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2. Filtering methods 

The filtering methods we want to compare are: 

a) 200-harmonic fitting (Moutou et al. 2005, team 3): it fits stellar variability by means 
of a linear combination of 200 harmonic functions whose frequencies are multiples of the 
fundamental frequency = 1/2T, where T is the whole duration of the time series, i. 
e. T <~ 150 days in the case of the CoRoT mission; 

b) 3-spot model (Lanza et al. 2003, 2007): it is a simplified physical model of solar- 
like variability based on the rotational modulation of the flux produced by three active 
regions, containing both cool spots and warm faculae, plus a constant component to ac- 
count for uniformly distributed active regions. In the case of the Sun, the model accounts 
for the flux variability up to a time scale of 14 days, after which the position and areas 
of the three regions and the uniform component have to be changed; 

c) Iterative non-linear (INL) filter (Aigrain & Irwin 2004; Moutou et al. 2005, team 
5): it is based on the computation of a continuum by applying a sliding median-boxcar 
filter. Points where the difference between the continuum and the original light curve is 
greater than 3 standard deviations are flagged and the continuum is recomputed without 
the flagged points, iterating the process up to convergence. The final continuum is then 
subtracted from the original light curve. 

3. Light curve simulation and analysis 

We apply a Monte Carlo approach by simulating a large number of light curves of duration 
150 days (the extension of the CoRoT long runs) for different values of planetary radius R 
ranging from 1.0 to 2.0 Earth radii, orbital period P between 5 and 50 days, and standard 
deviation of the photon shot noise a from 100 to 1000 parts per millions (ppm). A noise 
level a = 100 ppm is obtained for a star of V ~ 12 observed in white light by CoRoT 
with 1 hr integration time, while a = 200, 300 and 1000 ppm corrispond to stars of 
V ~ 13, 14 and 16, respectively, observed with the same instrument and 1 hr integration 
time. The phase of the first transit is taken from a uniform random distribution. The 
star is assumed to have the solar radius and mass. We add stellar variability, assumed 
in all the cases to be given by the Total Solar Irradiance variations as observed close to 
the maximum of solar cycle 23 (e.g., Frohlich & Lean 2004). For each set of planetary 
parameters and noise level, we simulate 100 light curves with different noise and activity 
realizations, for a total of 8000 light curves. 

After filtering solar variability with the three different filtering methods, transits are 
searched by means of the BLS algorithm. The ratio of the transit depth to the noise 
level is indicated by a, whose statistics determine the confidence level of a given transit 
detection (see Kovacs et al. 2002). 

Transitless light curves are analysed in the same way to establish the transit detection 
threshold for each filtering method, by requiring a maximum false-alarm rate of 1 percent. 

4. Results and conclusions 

For a = 100 ppm, the filtering methods that achieve the best performance are the INL 
filter and the 200-harmonic fitting with detections up to 98 percent for R = 1.25 R ffi 
and P = 10 days. In most of the cases they give comparable results, although in some 
instances the INL filter has a slightly better performance, owing to the Gibbs phenomenon 
(Morse & Feshbach 1954) affecting the 200-harmonic fitting. 
When a ^ 200 ppm, the method with the best performance is the INL filter when 
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Figure 1. Left: Distributions of the values of a, the signal-to-noise ratio of a transit detection, 
obtained by analysing light curves with transits of a planet of 1.5 R®. Each set is characterized 
by a different orbital period of the planet (as labelled). The standard deviation of the white 
Gaussian noise is in all cases 200 ppm. The red and blue vertical dot-dashed lines indicate the 
1 percent false-alarm threshold for the 200-harmonic fitting and the 3-spot model, respectively. 
Red solid histograms show the statistics of light curves where the period P was correctly iden- 
tified after applying the 200-harmonic method, blue solid histograms show those after applying 
the 3-spot method. Dashed histograms refer to the statistics of light curves where the period P 
was incorrectly identified, with the same color coding. Right: As on the left, distributions of the 
values of a obtained analysing the simulated light curves after applying the 3-spot model (blue 
histograms) and the 1NL filter (green histograms). 

we use an appropriate window of 2 days for the median boxcar filter. It shows a per- 
formance comparable with that of the 3-spot model in most of the cases, even better in 
some instances (see Fig. [T]). On the other hand, the 200-harmonic fitting has the worst 
performance because of the Gibbs phenomenon (see Fig. [1] and [2]). 

The performance of the INL filter depends critically on the adopted extension of the 
filter window. An optimal window of 2 days has been chosen for our analysis. Shorter 
windows negatively affect the transit detection since they give rise to a reduction of the 
transit depth in the filtered light curve (see Fig. [3]), in which case the 3-spot model and 
the 200-harmonic fitting would prove to be the best methods for the cases with a ^ 200 
ppm and a ~ 100 ppm, respectively. 

The optimal width of the median filter window depends on the magnetic activity level 
of the star and its rotation period. Specifically, the higher the activity level and the 
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Figure 2. The shape of a transit of an Earth-like planet as it appears in the ideal case (solid 
black line), in the residuals obtained with the 3-spot model (dashed blue line) and in those 
of the 200-harmonic fitting (dotted red line). Note the reduction of the transit depth and the 
overshooting at the edges of the transit dip due to the Gibbs phenomenon in the case of the 
200-harmonic fitting. 
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Figure 3. Upper panel: one of the light curves with transits simulated for the First CoRoT 
Blind test (Moutou et al. 2005; ID=460). Middle panel: the light curve filtered by means of 
the INL filter with a window of 2 days. Bottom panel: the filtered light curve with a 0.75 day 
window. Note the disappearance of the transits when the window extension is reduced. 

shorter the rotation period, the shorter the optimal window, because the time scales of 
the flux variations decrease with increasing activity. In other words, in the case of highly 
active stars the window extension has to be shortened with respect to the solar case, 
otherwise some oscillations or transit-like features will appear in the residuals owing to 
a bad filtering of the variability. To fix automatically the window extension, we propose 
a method similar to that of Regulo et al. (2007), computing the power spectrum of the 
time scries and choosing an extension corresponding to the frequency where the power 
spectral density goes below a fixed threshold, usually set at 10 -6 of the maximum power 
level. 

We conclude that the INL filter, when applied with a suitable choice of its window, 
has a better performance than more complicated and computationally intensive methods 
of fitting solar-like variability, like the 200-harmonic fitting or the 3-spot model. 
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